Communication apparatus, system and web page processing method

ABSTRACT

An apparatus and method for processing a Web page is disclosed. A Web processing unit receives a first Web page from a Web server and displays the first Web page on a screen. The Web processing unit acquires a communication program identified by the first Web page. An external state is monitored and thereby operation information responsive to the external state is generated. The Web processing unit executes the communication program to change the first Web page displayed on the screen to a second Web page according to the operation information.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2002-112338, filed Apr. 15, 2002, the entire contents of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention relates to a communication apparatus having a function of acquiring a Web page via the Internet, a system, and a Web page processing method.

[0004] 2. Description of the Related Art

[0005] In recent years, digital electric household appliances have become increasingly popular. Such conventional digital household appliances are mainly so-called “stand-alone” devices such as a digital TV, a DVD player and an MP3 player. However, more attention has recently been given to a category called “network electric household appliances.” The network electric household appliances sharply differ from the conventional electric household appliances in that they are mainly Internet-capable household devices. This means that Internet technology, whose main platform has been personal computers, begins to be used in the field of electric household appliances. In near future, Internet technology and personal-computer technology would be positively applied to the field of digital electric household appliances.

[0006] Technology of so-called Web browser is very prevalent in the technical field of personal computers and the Internet. Web browsers are now introduced not only into personal computers but also into most types of mobile phones and some types of televisions. It is widely recognized that the range of applications of Web browsers is very wide. It is expected that Web browser techniques will be applied very widely to electric household appliance controllers, remote controllers, etc.

[0007] One of Web browser techniques, which has drawn special attention, is Java^((R)). As is well known, the use of Java^((R)) can remarkably enhance the performance of expression on the screen displayed on the Web browser, and can realize display as if a computer application were running on the Web browser.

[0008] However, with increasing popularization of Internet-capable digital household appliances in the future, it will become necessary to harmoniously associate various functions required for digital household appliances, such as “voice recognition”, “speech synthesis”, “odor recognition” and “emotion recognition”, with network functions or Web browser functions. These functions need to be dynamically applied to the network household appliances and Web browsers in the future.

BRIEF SUMMARY OF THE INVENTION

[0009] The present invention has been made in consideration of the above circumstances, and its object is to provide a communication apparatus, system, and method capable of performing processing on a Web page in response to information relating to an external condition.

[0010] According to one aspect of the present invention, there is provided an apparatus for processing a Web page. A Web processing unit receives a first Web page from a Web server and displays the first Web page on a screen. The Web processing unit acquires a communication progrant identified by the first Web page. An external state is monitored and thereby operation information responsive to the external state is generated. The Web processing unit executes the communication program to change the first Web page displayed on the screen to a second Web page according to the operation information.

[0011] In one embodiment of the present invention, the communication apparatus can perform a process relating to a Web page in response to information relating to the external state. A user, for example, may create desired state as an external state and cause the communication apparatus to recognize the desired state, instead of effecting desired input using a mouse or a keyboard, thereby causing the communication apparatus to perform a predetermined process relating to a Web page.

[0012] In the other embodiment of the present invention, in a case where a server located at a place remote from the communication apparatus is provided with a function of acquiring information relating to an external state, the communication apparatus can perform a process relating to a Web page, in response to the information on the external state at the remote place.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

[0013]FIG. 1 shows an example of the structure of a whole network system according to an embodiment of the present invention;

[0014]FIG. 2 shows an example of the internal structure of a household terminal device according to the embodiment;

[0015]FIG. 3 shows an example of a Web page according to the embodiment;

[0016]FIG. 4 is a flow chart showing an example of a processing procedure for a Web page according to the embodiment;

[0017]FIG. 5 is a flow chart showing an example of a processing procedure of a communication applet according to the embodiment;

[0018]FIG. 6 is a flow chart showing an example of a processing procedure of a state-monitor processing section according to the embodiment;

[0019]FIG. 7 is a view for explaining an example of a process flow from a request for a Web page to acquisition of a communication applet in the embodiment;

[0020]FIG. 8 is a view for explaining an example of a predetermined process relating to a Web page, which is based on information relating to an external state in the present embodiment;

[0021]FIG. 9A and FIG. 9B are views for explaining a case of making use of a page comprising a plurality of frames according to the embodiment;

[0022]FIG. 10 is a view for explaining a case of making use of a page comprising a plurality of frames according to the embodiment;

[0023]FIG. 11 is a view for explaining a case of making no use of a page comprising a plurality of frames according to the embodiment;

[0024]FIG. 12 shows another example of the structure of the whole network system according to the embodiment;

[0025]FIG. 13 shows still another example of the structure of the whole network system according to the embodiment;

[0026]FIG. 14 shows still another example of the structure of the whole network system according to the embodiment; and

[0027]FIGS. 15A through 15D are views for explaining variations of a processing engine according to the embodiment.

DETAILED DESCRIPTION OF THE INVENTION

[0028] Embodiments of the present invention will now be described with reference to the accompanying drawings.

[0029]FIG. 1 shows an example of the structure of a whole network system according to an embodiment of the present invention.

[0030] In the example shown in FIG. 1, a terminal device (or a communication apparatus) 1 (e.g. a household appliance controller functioning as a mediator for controlling or monitoring a network electric household appliance connected to a household network) is connected to a household network (not shown) provided in a house (30). The household terminal device 1 is communicable via the household network with, e.g. a server connected to the Internet 8 (directly or via a LAN).

[0031] In FIG. 1, one or more other devices (e.g. another household appliance controller, a network household appliance, etc.) may be connected to the household network in the house (30). The household network may be a wired network, a wireless network, or a wired/wireless hybrid network (in the case of the wired/wireless hybrid network, a device connected to the household network may have only a wired-communication interface, or only a wireless communication interface, or both wired and wireless communication interfaces).

[0032] On the other hand, in the example of FIG. 1, a Web server 2, which is disposed outside the house, is connected to the Internet 8. Of course, another server or a router may be connected to the Internet 8.

[0033] In the present embodiment, the household terminal device 1 has a function (a processing engine (e.g. a voice recognition engine)) of acquiring information relating to an external condition (e.g. a word (a recognition of a word) spoken by a user), on the basis of information received through an interface with the outside (e.g. a microphone, a camera, a device detecting a physical quantity, such as an odor sensor). In response to the information relating to the external condition, the terminal device 1 can perform processing on a Web page. According to this function, for example, the user may create a desired state as an external state (intentionally or non-intentionally) instead of performing a desired input operation using a mouse or a keyboard, thereby causing the terminal device 1 to recognize the external state (e.g. a character string, which the user desires to select, generated). Thus, the terminal device 1 is caused to perform a predetermined process relating to a Web page. In the description that follows, the household terminal device 1 is provided with a voice recognition function (voice recognition engine) by way of example. Using the voice recognition function, the user of the terminal device 1 inputs voice to the terminal device 1 (instead of manually inputting an URL of a Web page that the use wishes to access, or a corresponding character string), thus becoming able to perform Web surfing. In the case where one or more devices, other than the terminal device 1, are connected in the house, some or all of these devices may have the same function.

[0034] In the example shown in FIG. 1, the household terminal device 1 communicates with the external Web server 2 connected to the Internet 8.

[0035]FIG. 2 shows an example of the internal structure of the household terminal device 1 according to the embodiment of the invention.

[0036] As is shown in FIG. 2, the household terminal device 1 (in the case of performing voice recognition for acquiring information relating to an external state) comprises a Web browser 11, a local Web server 12, a voice recognition engine 13 and a state-monitor processing section 14. Each of the Web browser 11, local Web server 12, voice recognition engine 13 and state-monitor processing section 14 can be realized by a program (or by causing a CPU to execute a program). In addition, some or all of them can be realized by hardware such as semiconductor integrated circuits. In FIG. 2, an input device, a display, a communication apparatus, etc. are omitted.

[0037] The Web browser 11 is, for instance, a browser capable of letting a user view HTML Web pages (e.g. a Java^((R))—capable Web browser in the case of using a Java^((R)) applet as a communication applet to be described later).

[0038] The Web browser 1 has a function of communicating with the local Web server 12, external Web server 2, etc. according to HTTP. In the present embodiment, it is assumed that the Web browser 11 and local Web server 12 are built in the same terminal device 1. Alternatively, they may be provided in separate devices connected via a household network (not shown). (In the former case, intra-apparatus communication (e.g. inter-process communication) is performed, whereas in the latter case communication is performed via a household network.)

[0039] The voice recognition engine 13 (as an example of the processing engine) functions as follows. The voice recognition engine 13 receives voice from a voice microphone (not shown) built in the terminal device 1 or externally connected via a connection terminal of the terminal device 1. Only a meaningful portion of the user's voice is extracted by voice recognition (preferably, high-quality voice recognition) using an internal dictionary (not shown). The extracted result is output as a character string (e.g. a character string of Japanese text).

[0040] In response to a request from a communication applet (to be described later), the state-monitor processing section 14 acquires a processed result of the processing engine (voice recognition engine 13 in this embodiment) provided in the terminal device 1, and returns the acquired processed result to the requesting side. In this embodiment, the state-monitor processing section 14 acquires a word, which has been immediately previously recognized by the voice recognition engine 13, as a character string.

[0041] The local Web server 12 is, for example, a Web server which is set to be able to accept an access from the Web browser 11. The local Web server 12 pre-stores a communication applet (e.g. Java^((R)) applet) 121. The communication applet 121 in this embodiment has a function of communicating with the state-monitor processing section 14 and being capable of receiving the processed result (in this embodiment, a character string recognized by the voice recognition engine 13) of the processing engine which the state-monitor processing section 14 acquires from the processing engine.

[0042] In the case where the local Web server 12 and Web browser 11 are built in the same terminal device 1, the local Web server 12 may be configured to accept only access from the Web browser 11 embedded in the same terminal device 1 (in other words, in the case where the local Web server 12 and Web browser 11 are built in the same terminal device 1, the local Web server 12 may be set such that it cannot accept access from a device other than the same terminal device 1).

[0043] On the other hand, the external Web server 2 shown in FIG. 1 stores Web pages which are assumed to be sent to a terminal such as household terminal device 1.

[0044] It is assumed that these Web pages are described in various formats. FIG. 3 shows an example of a Web page used in the present embodiment. As is shown in FIG. 3, the Web page 21 includes description for downloading a communication applet, e.g., <applet codebase=“http://local Web server/applet/” code=“communication applet.class” width=0 height=0>. In a word, the local Web server in the terminal device 1, in which the Web page 21 is downloaded, is designated in the Web page 21 as a download source of the applet. Note that in FIG. 3, “external Web server” indicates an address (name) of an external server such as “www.xyz.co.jp”.

[0045]FIG. 4 illustrates an example of the processing procedure of the Web browser 11 in the case where acquisition of a Web page is instructed. A designated Web page is acquired (upon a user's instruction to acquire a Web page) (step S1). If a communication applet that designates a download source is embedded in the acquired Web page (step S2), the communication applet is acquired from the designated download source (step S3). The acquired communication applet is executed in association with the Web page. This procedure may be executed prior to display of the acquired Web page, or after completion of display, or independent from display.

[0046]FIG. 5 shows an example of the processing procedure of the application applet 121. The procedure illustrated in FIG. 5 is periodically repeated while the communication applet 121 is running. The state-monitor processing section 14 is accessed (when a time to access the state-monitor processing section 14 has come, for example, on the basis of a timer) (step S11). If the communication applet 121 receives information from the state-monitor processing section 14 to the effect that an empty character string is received or a character string is absent (step S12), no processing is performed. Even where a non-empty character string is received from the state-monitor processing section 14 (step S12), if a URL corresponding to the received character string is not designated, no processing is performed, However, in the case where a non-empty character string is received from the state-monitor processing section 14 (step S12), if a URL corresponding to the received character string is designated, the corresponding Web page is acquired (step S13). In this example, if the corresponding Web page is acquired and displayed, the execution of the communication applet 121 corresponding to the original Web page is suspended.

[0047]FIG. 6 illustrates an example of a processing procedure of the state-monitor processing section 14 in the case where it is accessed by the communication applet 121. The state-monitor processing section 14 accesses the voice recognition engine 13 (when it is accessed by the communication applet 121) (step S21). If the state-monitor processing section 14 receives a non-empty character string from the voice recognition engine 13 (step S22), it returns the character string to the communication applet 121 (step S23). On the other hand, when the state-monitor processing section 14 receives an empty character string or information that there is no character string from the voice recognition engine 13 (step S22), it returns the empty character string or the information that there is no character string to the applet 121 (step S24).

[0048] Referring now to FIG. 7, a process flow from the request for a Web page to the acquisition of the communication applet will be described.

[0049]FIG. 7 shows a state in which the user of the household terminal device 1 accesses, from the Web browser 11 of the terminal device 1, a web page (refer to the Web page shown in FIG. 3) in the external Web server 2, which is designated by

[0050] “http://external Web server/html/startpage.html”.

[0051] To begin with, the user requests the Web page of interest from the external Web server 2 via the Web browser 11.

[0052] The Web browser 11 makes an access request at URL “http://external Web server/html/startpage.html” of the external Web server 12 (see S101 in FIG. 7). Then, the designated Web page 21, i.e. “startpage.html”, is downloaded in the Web browser 11 (see S102 in FIG. 7).

[0053] In the Web page 21, a description of a communication applet, which explicitly designates a download source, like “codebase=“http://local Web server/applet/” code=“communication applet.class”, is embedded. In this case, immediately after downloading the Web page 21, the Web browser 11 accesses the following URL of the local Web server 12 (not of the external Web server 2): http://local Web server/applet/ communication applet.class (see S103 in FIG. 7). As mentioned above, the communication applet 121 is located in this URL. Thus, the communication applet 121 stored in the local Web server 12 is downloaded in the Web browser 11 (see S104 in FIG. 7).

[0054] When the Web page 21 is to be transmitted from the Web server 2 to the household terminal device 1, it is assumed that the external Web server 2 has such information that the local Web server 12 is built in the household terminal device 1, and also has information as to what functions the communication applet 121 located in the local Web server 12 has and information as to the interface, location, etc. of the communication applet 121.

[0055] In general, these pieces of information vary from terminal device to terminal device. For example, cookies may be used as means for pre-storing the information in the external Web server 2. When the Web browser 11 issues to the external Web server 2 a request for acquiring the Web page 21, it sends a cookie at the same time. In the cookie, described is necessary information on functions of the communication applet 121 set in the local Web server 12, and on the interface and location of the communication applet 121. The external Web server 2 can acquire necessary information from the cookie sent from the Web browser 11. If the external Web server 2 stores information on each terminal device as a database, only such information as to discriminate the terminal device may be described in the cookie sent from the Web browser 11.

[0056] The downloaded communication applet 121 is executed on tile Web browser 11. It should be noted, however, that the communication applet 121 is set at a width=0 and a height=0 (refer to the portion in FIG. 3, <applet codebase=“http://local Web server/applet/” code=“communication applet.class” width=0 height=0>), and thus the communication applet 121 is not viewed on the Web browser 11, while it is being executed, and is not recognized by the user. Alternatively, the width and height may be set at values greater than 0, so that the user can visually recognize the communication applet 121.

[0057] For reasons of security, a communication applet is communicable with only the source of download of the communication applet. The communication applet 121 downloaded front the local Web server 12 in the terminal device 1 can communicate with (only) the state-monitor processing section 14 provided in the same terminal apparatus 1 as the local Web server 12, which is the download source, Based on the communication by the communication applet 121, the present embodiment is configured to reflect on the Web browser 11 the information on the external state, i.e. the recognition result of the voice recognition engine 13 in this embodiment, via the state-monitor processing section 14.

[0058] Referring to FIG. 8, a description will now be given of communications between the communication applet 121, state-monitor processing section 14 and voice recognition engine 13, and the function of Web page jump by voice input, as an example of a predetermined process relating to a Web page based on information on an external state.

[0059] For example, the Web page 21 shown in FIG. 3 is displayed on the display screen (not shown) of the Web browser 11 of household terminal device 1. The display screen also displays character strings “apple” (Japanese), “orange” and “strawberry”. If a user (3) selects a given character string (e.g. “apple”), the Web browser 11 accesses the associated server according to the description of a hyperlink (<a href=. . . >in FIG. 3), acquires the corresponding Web page, and displays it on the display screen as a new Web page.

[0060] When the user (3) selects “apple” as a freely chosen character string, the user (3) may manually click a display image portion of “apple” on the display screen using a mouse, or may utter voice “apple” instead of clicking. The voice is recognized by the voice recognition engine 13, and the recognized result is reflected on the Web browser 11 via the state-monitor processing section 14 under the operation of the communication applet 121. Thus, the Web page is jumped to the following page designated by “apple”:

[0061] http://external Web server/html/apple.html.

[0062] Assume that the user (3) of terminal device 1 viewed a Web page displayed on the display screen, and desired to browse a Web page designated by “apple” and uttered “apple”. The voice recognition engine 13 memorizes, as a character string, a word recognized based on the user's uttered voice (refer to S121 in FIG. 8).

[0063] On the other hand, the communication applet 121 periodically perform polling to determine the status of the state-monitor processing section 14 (refer to S122 in FIG. 8). The polling is done, for example, to determine whether there is a recognized character string.

[0064] Each time the state-monitor processing section 14 is accessed by the polling, it inquires of the voice recognition engine 13 as to what word (character string) has been recognized. If the voice recognition engine 13 has a voice recognition result, it stores a character string. In response to the inquiry from the state-monitor processing section 14, the voice recognition engine 13 returns the character string. The state-monitor processing section 14 returns the character string as a response to the communication applet 121 (refer to S123 in FIG. 8). If the voice recognition engine 13 recognizes nothing (i.e. if the voice recognition engine 13 returns information that nothing is recognized, for example, by an empty character string), the state-monitor processing section 14 returns, e.g. an empty character string, thereby notifying the communication applet 121 that nothing is recognized.

[0065] In this communication applet 121, as exemplified by numeral 1211 in FIG. 8, arguments are designated, each of which indicates correspondence between a received character string and an associated URL-designated Web page to be displayed (refer to examples of “apple”, “orange” and “strawberry” in FIG. 3). If a character string, which the communication applet 121 received from the state-monitor processing section 14, is included in these arguments (see S124 in FIG. 8), the communication applet 121 causes the Web browser 11 to display the corresponding URL.

[0066] In the example of FIG. 3, when the voice recognition engine 13 recognizes “apple”, for instance, the communication applet 121 receives a character string “apple” via the state-monitor processing section 14, and a Web page of the following URL is displayed (see S125 in FIG. 8):

[0067] http://external Web server/html/apple.html.

[0068] Similarly, when the word “orange” is recognized, the URL,

[0069] http://external Web server/html/orange.html is displayed.

[0070] When the word “strawberry” is recognized, the URL,

[0071] http://external Web server/html/strawberry.html is displayed.

[0072] On the other hand, the Web page is not changed, if the communication applet 121 receives information that there is no character string, e.g. an empty character string, or the communication applet 121 receives a character string that is not designated by the arguments.

[0073] There are three modes of selection of a desired character string in connection with each Web page: (1) either manual input, e.g. by clicking a character string on the display screen using a mouse, or voice input, (2) only voice input, and (3) only manual input.

[0074] In the above embodiment, the English language is the object of recognition by the voice recognition engine 13. Needless to say, other languages, e.g. Japanese, may be recognized, or one or more of a plurality of languages may be recognized.

[0075] In the above description, URLs are directly associated with “apple”, “orange” and “strawberry”. Alternatively, the display screen may be caused to display “1: apple”, “2: orange” and “3: strawberry”, the user may generate a numeral corresponding to the desired item, and the Web page corresponding to the recognized numeral may be acquired (it is also possible to adopt a method that can cope with either a case where the user generates a numeral corresponding to the desired item or a case where the user generates a character string corresponding to the desired item).

[0076] In the case where a new Web page is acquired and displayed on the Web browser 11 by using the voice recognition, etc., such a new Web page may be associated with a given URL (a description of a communication applet that explicitly designates a download source may, or may not, be embedded in a new Web page). However, because of restrictions due to security of a Java^((R)) applet, the content of a new Web data, which is received from a source other than the download source of the applet, cannot directly be referred to from the communication applet 121 corresponding to the original Web page.

[0077] If the Web page is changed, the communication applet embedded in the original Web page is stopped. However, if the description of the same communication applet is embedded in a new Web page at a destination of transition, e.g. the above-mentioned

[0078] http://external Web server/html/apple.html,

[0079] communication between the same communication applet and state-monitor processing section begins on the Web page at the destination of transition, and the operation can be continued.

[0080] As described above, the communication applet embedded in the Web page communicates with the state-monitor processing section that operates on the Web server at the download source of the communication applet. Thereby, a given web page can dynamically be displayed on the Web browser in accordance with the state (or change of state) detected by the state-monitor processing section.

[0081] As has been described above, even in the method of embedding the description of the communication applet in each Web page, if the same communication applet is used on the Web page at the destination of transition, the communication applet need not be downloaded each time. The reason is that, in general, once a communication applet is downloaded, the duplicate stored in the cache is used in subsequent operations. However, an overhead may possibly occur each time the communication applet is initiated.

[0082] There is a method of avoiding an overhead due to start of the communication applet.

[0083] Specifically, the same communication applet is kept on running. As is shown in FIG. 9A, a page (e.g. first page) is composed of a plurality of pages, e.g. frame 1 and frame 2. The description of the communication applet is embedded in a Web page displayed in frame 1. A new Web page, which is opened according to a change in status, is displayed in frame 2 (the provision of frame 3, . . . , is possible alternatively). With this configuration, frame 1 is unchanged even if the Web page in frame 2 transits to the new page. Thus, the communication applet is not stopped and keeps on running.

[0084] As is shown in FIG. 9B, the frame is not necessarily be displayed as an easily recognizable frame. The frame may be formed with a width=0 or a height=0. In this frame, the communication applet may be run so that the display of the frame may not easily recognized by the user.

[0085] The method of keeping the same communication applet running by using frames is particularly effective for a page structure, as shown in FIG. 10, that is generally adopted in web pages using frames, wherein items of link to each page appears on a sub-frame and the associated content appears on a main frame (in this case, as shown in FIG. 10, the communication applet (resident applet) is run in the sub-frame). The reason is that link destination candidates are understood at the time of first displaying the whole frame including the sub-frame, so they can be registered as arguments at the time of start of the communication applet.

[0086] On the other hand, in the case of a page structure wherein new link destinations are successively displayed each time a new page is opened, as indicated by (a)→(b)→(c) in FIG. 11, all link destinations are not understood at the time of start of the communication applet. In such a case, if continuous execution of the same communication applet is desired, such a configuration as to dynamically pass arguments to the communication applet may be added and the arguments may be used. Alternatively, the communication applet may be embedded in all pages, as mentioned above.

[0087] The above-described method of embedding the description of the communication applet in each Web page or the method of using the frames as illustrated in FIG. 9 may be properly selected and used.

[0088] In the example described above, the Web page, as shown in FIG. 3, in which the descriptions of the communication applet and associated arguments are embedded, is prepared in advance. In an example to be described below, the description of the communication applet is added to the Web page in a device other than the Web server 2, thereby forming the Web page as shown in FIG. 3.

[0089]FIG. 12 shows an example of the structure of this technique. In FIG. 12, a proxy server intervenes between the external Web server 2 and Web browser 11. Before a Web page acquired from the external Web server 2 is displayed, the description of the communication applet is embedded in the Web page by the proxy server 18. If appropriate Web page conversion is not effected, Web surfing by voice input can be realized for a given Web page in a given Web server.

[0090] In this case, for example, a tag <applet> of a communication applet is embedded in a Web page of interest. A set of a character string, which is interposed between link-designating tags,

[0091] <a href= . . . > . . . </a>

[0092] included in the Web page, and a destination of the link, is given as arguments of the communication applet.

[0093] In the example described above, the local Web server is the download source of the communication applet. In an example to be described below, some other device is designated as the download source.

[0094] A freely chosen Web server can be designated as the download source of the communication applet. For example, a server located remote from the household terminal device 1 may be designated, and different Web pages nay be opened according to the state (or change of state) of the remote place.

[0095] The proxy server in FIG. 12 is an example of the server provided within the terminal device. The proxy server, however, may be located anywhere. For example, as shown in FIG. 13, a proxy server may be provided independently at the entrance of a household network (household LAN) provided within the house (in this case, the proxy server being transparently available by the user in the home). Alternatively, a proxy server may be provided en route over the Internet. Alternatively, a proxy server may be provided at the exit of the external server (in this case, the proxy server being transparently available by user who accesses it).

[0096]FIG. 14 shows an example of the network system configuration in this case. A server 4 is located remote from the household terminal device 1. Like the local Web server 12, the server 4 has a function of providing the communication applet 121 to the terminal device 1. In addition, the server 4 includes the same processing engine 43, such as the voice recognition engine, and state-monitor processing section 44 as have been described above. In this case, the communication applet 121, which runs on the Web browser 11 of terminal device 1, communicates with the state-monitor processing section 44 of server 4. In the server 4, the state-monitor processing section 44 communicates with the processing engine 43. Thus, the communication applet 121 in the terminal device 1 can acquire information on the external state of the remote server 4.

[0097] In this case, if HTTP is used for communication between the communication applet 121 and state-monitor processing section 44, the configuration of the state-monitor processing section 44 can be used in such an environment that data communication based on protocols other than HTTP is blocked indoors or outdoors for the purpose of security.

[0098] In the case of using HTTP, the state-monitor processing section 44 may be used as a cgi (common gateway interface). The communication applet 121 sends a request to the cgi as an HTTP request, and receives a character string obtained by the state-monitor processing section 44 as an HTTP response.

[0099] In the case where the state-monitor processing section 44 is remotely located, a traffic between the communication applet 121 and state-monitor processing section 44 may become a problem. In the example shown in FIG. 8, each time the communication applet 121 issues a request, the state-monitor processing section 14 returns some character string generated by the voice recognition engine 13. The communication applet 121 determines, based on the character string, whether transition of the Web page should be made or not. Instead, for example, at the time of activating the communication applet 121, arguments given to the communication applet 121 may be sent to and registered in the state-monitor processing section 44. Even when some processing result (e.g. some character string from the voice recognition engine) is generated by the processing engine 43, the state-monitor processing section 44 may return information that there is no processing result (e.g. the absence of a character string, e.g. by an empty character string) in cases other than the registered arguments (e.g. registered character strings). By such new architecture for registration and deletion of character strings, the traffic can be reduced. This architecture is applicable to the case of FIG. 8.

[0100] In the example that has been mainly described above, a predetermined process relating to a Web page is performed when the acquired information relating to an external state coincides with preset information on an external state. Alternatively, a predetermined process relating to a Web page can be performed on the basis of transition of information relating to an external state. For example, at least information relating to an external state, which is the latest acquired information, is stored. When a transition of an information set of the latest information relating to an external state and the next latest information relating to an external state, which is stored at that time, coincides with a preset transition of information relating to an external state, a predetermined process relating to a Web page may be performed (e.g. a Web page with a URL corresponding to the transition content is acquired). Specifically, assume that URL 1 is designated to a transition “word A→word B”, URL 2 to a transition “word A→word C”, and URL 3 to a transition “word A→word D”. In this case, assume that the user first generates word A and then generates word B in the state that word A is recognized, and the word B is then recognized. Then, a Web page with URL 1 corresponding to the transition “word A→word B” is acquired (i.e. if the user generates word A and word B in this order, the Web page with URL 1 can be acquired and displayed). Other various modifications can also be made.

[0101] In the embodiment described above, the description has been given of the function (processing engine) for acquiring information relating to an external state on the basis of information that the household terminal device 1 receives from the interface with the outside. In addition, the terminal device 1 has been described as being able to perform the process relating to the Web page in response to information relating to the external state. As a concrete example, recognition of voice uttered by the user, as shown in FIG. 15A, was described. Other variations, aside from the voice recognition, are possible with respect to how to acquire information relating to the external state.

[0102] For example, by using an image recognition function as shown in FIG. 15B, an image recognition result can be used as information relating to an external state. There may be various objects of image recognition. An expression produced by the user by, e.g. a sign language or lip reading, is read and a corresponding character string is generated. The generated character string can be used. Besides, the user may perform a predetermined specific action pattern, and the action pattern is recognized and used. Further, predetermined characteristic amounts (e.g. brightness, typical color, rapid motion) may be extracted from an acquired image of the surrounding, and a predetermined process relating to the Web page may be performed based on the extracted result.

[0103] As is shown in FIG. 15C, by using a function of emotion recognition for the user, e.g. guessing delight, anger, sorrow and pleasure of the user at the time, based on the voice recognition of the user's voice and the image recognition of a face image of the user, an emotion recognition result can be used as information relating to the external state.

[0104] As is shown in FIG. 15D, by using an odor recognition function using an odor sensor, an odor recognition result can be used as information relating the external state.

[0105] In the above descriptions, the terminal device 1 connected to the household network is connected to the Internet via the household network. Of course, the invention is not limited to this configuration. In this invention, the terminal device, which is connected to a network other than the household network (e.g. an intranet, a mobile phone service provider's network, or an Internet provider's network) may be connected to the Internet via the network. Furthermore, the invention is also applicable to a terminal device directly connected to the Internet.

[0106] The functions described above are realizable as software.

[0107] Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details and representative embodiments shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents. 

What is claimed is:
 1. A communication apparatus which receives a first Web page from a Web server and displays the first Web page on a screen, comprising: a communication program storing unit configured to store a communication program; a monitoring unit configured to monitor an external state and to generate operation information responsive to the external state; and a Web processing unit which executes the communication program to receive the operation information from the monitoring unit and change the first Web page displayed on the screen to a second Web page according to the operation information.
 2. The apparatus according to claim 1, wherein the monitoring unit generates the operation information including a character string which corresponds to a location of the second Web page.
 3. The apparatus according to claim 1, further comprising a voice recognition unit configured to recognize a voice and send a recognized voice as the external state to the monitoring unit.
 4. The apparatus according to claim 1, wherein the Web processing unit executes the communication program in a first frame and displays at least one of the first Web page and the second Web page in a second frame, and wherein the communication program is resident in the first frame during a page transition between the first Web page and the second Web page.
 5. The apparatus according to claim 4, wherein the first frame is invisible on the screen.
 6. A communication system comprising: a local web server including a communication program storing unit configured to store a communication program and a monitoring unit configured to monitor an external state and to generate operation information responsive to the external state, wherein the communication program storing unit and the monitoring unit are in a single download source regarding security; and a Web processing client which receives a first Web page from a remote Web server and displays the first Web page on a screen and configured to download the communication program from the local web server and execute the communication program to communicate with the monitoring unit in the local web server, receive the operation information from the monitoring unit, and change the first Web page displayed on the screen to a second Web page according to the operation information.
 7. The system according to claim 6, wherein the operation information includes a character string which corresponds to a location of the second Web page.
 8. The system according to claim 6, further comprising a voice recognition unit connected to the monitoring unit and configured to recognize a voice and send a voice recognition result of the recognized voice as the external state to the monitoring unit.
 9. The system according to claim 6, wherein the Web processing client executes the communication program in a first frame and displays at least one of the first Web page and the second Web page in a second frame, and wherein the communication program is resident in the first frame during a page transition between the first Web page and the second Web page.
 10. The system according to claim 9, wherein the first frame is invisible on the screen.
 11. The system according to claim 6, wherein the monitoring unit and the communication protocol communicate with each other in accordance with a HTTP protocol via a HTTP port.
 12. A communication apparatus which communicates with a local web server, comprising: a Web processing unit configured to receive a first Web page from a remote Web server and to display the first Web page on a screen; a program downloading unit configure to download a communication program from the local web server; and a program executing unit configured to execute the communication program to communicate with a monitoring unit in the local web server, which monitors an external state and generates operation information responsive to the external state, receive the operation information from the monitoring unit, and change the first Web page displayed on the screen to a second Web page according to the operation information.
 13. The apparatus according to claim 12, wherein the operation information includes a character string which corresponds to a location of the second Web page.
 14. The apparatus according to claim 12, wherein the Web processing unit executes the communication program in a first frame and displays at least one of the first Web page and the second Web page in a second frame, and wherein the communication program is resident in the first frame during a page transition between the first Web page and the second Web page.
 15. The system according to claim 14, wherein the first frame is invisible on the screen.
 16. A computer system for communicating with a local web server, comprising: a processor; a memory accessible to the processor; and a software application stored on the memory, wherein the software application comprises: first program code means for receiving a first Web page from a remote Web server and for displaying the first Web page on a screen; second program code means for downloading a communication program from the local web server; and third program code means for executing the communication program to communicate with a monitoring unit in the local web server, which monitors an external state and generates operation information responsive to the external state, receive the operation information from the monitoring unit, and change the first Web page displayed on the screen to a second Web page according to the operation information.
 17. The computer system according to claim 16, wherein the operation information includes a character string which corresponds to a location of the second Web page.
 18. The computer system according to claim 16, wherein the first program code means causes the processor to execute the communication program in a first frame and causes the processor to display at least one of the first Web page and the second Web page in a second frame, and wherein the communication program is resident in the first frame during a page transition between the first Web page and the second Web page.
 19. The computer system according to claim 18, wherein the first frame is invisible on the screen.
 20. A computer implemented method of processing a Web page, comprising: receiving a first Web page from a Web server by a Web processing unit; acquiring a communication program identified by the first Web page; displaying the first Web page on a screen by the Web processing unit; monitoring an external state to generate operation information responsive to the external state; executing the communication program by the Web processing unit to change the first Web page displayed on the screen to a second Web page according to the operation information.
 21. The method according to claim 20, wherein the operation information includes a character string which corresponds to a location of the second Web page.
 22. The method according to claim 20, wherein the Web processing unit executes the communication program in a first frame and displays at least one of the first Web page and the second Web page in a second frame, and wherein the communication program is resident in the first frame during a page transition between the first Web page and the second Web page.
 23. The method according to claim 22, wherein the first frame is invisible on the screen. 